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Abstract 

, A bounded degree structure is either a relational structure all of whose relations 

are of bounded degree or a functional structure involving bijective functions only. 
In this paper, we revisit the complexity of the evaluation problem of not necessarily 
Boolean first-order queries over structures of bounded degree. Query evaluation 
is considered here as a dynamical process. We prove that any query on bounded 
degree structures is CONSTANT-DELAY;,,,, i.e., can be computed by an algorithm 
that has two separate parts: it has a precomputation step of linear time in the size 
of the structure and then, it outputs all tuples one by one with a constant (i.e. 
depending on the size of the formula only) delay between each. Seen as a global 
process, this implies that queries on bounded structures can be evaluated in total 
{Sj | time 0(/(|</?|).(|<S| + \ip(S)\)) and space 0(f(\ip\).\S\) where S is the structure, (p is 

the formula, <p(S) is the result of the query and / is some function. 

Among other things, our results generalize a result of [See96] on the data com- 
plexity of the model-checking problem for bounded degree structures. Besides, 
\ the originality of our approach compared to that [See96] and comparable results 

"■^ ■ is that it does not rely on the Hanf's model-theoretic technic (see [Han65]) and is 
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completely effective. 

Introduction 
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Evaluating the expressive power of logical formalisms is an important task in theoret- 
ical computer science. It has many applications in numerous fields such as complexity 
theory, verification or databases. In this latter case, it often amounts to determine how 
difficult it is to compute a query written in a given language. In this vein, determining 
which fragments of first-order logic defines tractable query languages has deserved much 
attention. 

It is well known, that over an arbitrary signature, computing a first-order query can 
be done in time polynomial in the size of the structure (and even in logarithmic space 
and AC ). However the exponent of this polynomial depends heavily on the formula 
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size (more precisely, on the number of variables). Nevertheless, for particular kinds of 
structures or formulas the complexity bound can be substantially improved. In [See96], 
it is proved that checking if a given first-order sentence <p is true (i.e., the Boolean query 
or model-checking problem) in a structure S all of whose relations are of bounded degree 
can be done in linear time in the size of S. The method used to prove this result relies 
on old model-theoretic technics (see [Han65]). It is perfectly constructive but hardly 
implement able. Later, still using such kind of methods, several other tractability results 
have been shown for the complexity of the model-checking of first-order formulas over 
structures or formulas that admit nice (tree) decomposition properties (see [FFG02]). 
In this paper, a bounded degree structure is either a relational structure all of whose 
relations are of bounded degree or a functional structure involving bijective functions 
only. 

The main goal of this paper is to revisit the complexity of the evaluation problem of not 
necessarily Boolean first-order queries over structures of bounded degree. We regard 
query evaluation as a dynamical process. Instead of considering the cost of the evaluation 
globally, we measure the delay between consecutive tuples, i.e., query problems are 
viewed as enumeration problems. This latter kind of problems appears widely in many 
areas of computer science (see for example [EG95, EGM03, BGKMOO, KSSOO, Gol94] 
or [JYP88] for basic complexity notions on enumeration). However, to our knowledge, 
relation to query evaluation has not been investigated so far. 

We prove that any query on bounded degree structures is CONSTANT-DELAY/j n , i.e., 
can be computed by an algorithm that has two separate parts: it has a precomputation 
step whose time complexity is linear in the size of the structure and then, outputs all 
the solution tuples one by one with a constant (i.e., depending on the size of the formula 
only) delay between two successive tuples. Seen as a global process, this implies that 
queries on bounded structures can be evaluated in total time 0(/(|(/9|).(|5| + |(/?(5)|)) and 
space 0(/(|y?|).|«S|) where |«S| is the size of the structure S, \ip\ is that of the formula ip, 
\f(S)\ is the size of the result <p(S) of the query and / is some function. As a corollary, 
it implies that the time complexity of the model-checking problem is 0(/(|</?|).|<S|) thus 
providing an alternative proof of the result of [See96] . 

A particularity of the main method used in this paper is that it does not rely on model- 
theoretic technic as previous results of the same kind (see, for example, [See96] or [Lin04] 
for a generalization to least-fixed point formulas). Instead, we develop a quantifier elimi- 
nation method suitable for bijective unary functions and apply it to obtain our complex- 
ity bound. An advantage of this method is that it is effective and easily implementable. 
Another advantage is that our paper is completely self-contained. 

Besides, the CONSTANT-DELAY/j n class is an interesting notion by itself and is, to our 
knowledge, a new complexity class for enumeration problems: as proved for linear time 
complexity (the class DLIN studied in [GS02]) it can be shown that CONSTANT-DELAY;,^ 
is a robust class and is in some sense the minimal robust complexity class of enumeration 
problems. 

The paper is organized as follows. First, basic definitions are given in Section 1. In 
particular, in Subsection 1.3, we recall definitions about enumeration problems and in- 
troduce the notion of constant delay computation and prove some basic properties about 
it. In Section 2, the quantifier elimination method is introduced and is applied to the 
evaluation problem of first-order formulas over functional structures all of whose func- 
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tions are bijective. In Section 3, using classical logical interpretation technics, this later 
problem is reduced in linear time to the first-order query problem over structures of 
bounded degree thus providing the same bound for it. Finally, in Subsection 3.3, con- 
sequences about the complexity of the subgraph (resp. induced subgraph) isomorphism 
problem are given. 

1 Definitions 



1.1 Logical definitions and query problems 

We suppose the reader to be familiar with basic notions of first-order logic. A signature 

a is a finite set of relational and functional symbols of given arities (0-ary function 

symbols are constants symbols). The arity of a is the maximal arity of its symbols. The 

set a is called unary functional if all its symbols are of arity bounded by one. 

A (finite) a-structure consists of a domain D together with an interpretation of each 

symbol of a over D (the same notation is used here for each signature symbol and its 

interpretation). 

In this paper, we will distinguish between two kinds of signatures on which semantical 
restrictions on their possible interpretation are imposed: 

• Either a is made of constant and monadic (i.e., unary) relation symbols and unary 
function symbols whose interpretation is taken among bijective functions (i.e., 
permutations) only, 

• Or a contains relation symbols only whose degrees are bounded by some given 
constant (detailed definitions about bounded degree relations are delayed till sec- 
tion 3). 

Structures defined by either of semantical restrictions will be called bounded degree struc- 
tures. 

In what follows we make precise notions and problems about first-order logic over bijec- 
tive structures. 

Definition 1 Let a = {c, U, /i, . . . , fa} be a signature consisting of constant symbols Ci G c, 
of monadic predicates Ui € U and of unary function symbols fi,i = 1, . . . , k. A bijective 
a-structure is a a-structure S of the form S = (D;c,U, /i, . . . , /&) where each fi is a permu- 
tation on domain D. 

One of the main results of this paper provides a quantifier elimination method over 
bijective structures. As it is usual for such kind of result, the elimination will be done 
in a richer language. The following definition is required. 

Definition 2 A bijective term t(x) is of the form fi 1 ... fi'(x) where I > 0, x is a variable 
and where each is either the function symbol fi or its reciprocal ff . The term t~ 1 (x) 
denotes the reciprocal of the term t(x). 

A bijective atomic formula is of one of the following four forms where t(x) and ti(x) are 
bijective terms: 
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• either a bijective equality t(x) = Ti(y), 

• or t(x) = c where c is a constant symbol, 

• or U(t(x)) where U is a monadic predicate, 

• or a cardinality statement 3 k ,^/(x) where the quantifier 3 k is interpreted as "there exist 
at least k values of x such that" and ^> is a Boolean combination of bijective atoms a(x) 
over variable x only. 

As the reciprocal of each function symbol can be used, each bijective equality t(x) = 
Ti(y) can be rephrased as T2(x) = y where T2(x) = r 1 _1 r(x). A bijective literal is a 
bijective atomic formula or its negation. 

Definition3 The set FOBij of bijective first-order formulas is the set of first-order formulas 
built over bijective atomic formulas of some unary signature a. 

Let t = (t\, . . . ,tk) be a A;-tuple of variables and ip(t) and (f'(t) be two a-formulas with 
free variables t. Formulas <p(t) and p>'(t) are equivalent if for all a-structures S and all 
tuples a of element of the domain with \a\ = \t\ it holds that: 

(S,a) \=<p@ iff (S,a)\=ip'(t). 

In this paper query problems are considered for specific classes of first-order formulas 
(and structures). One of the specific problems under consideration here is the following. 

Query (FO Bij ) 

Input: a unary functional signature a, a bijective u-structure S and a first-order bijective 
cj-formula <p(x) with k free variables x = (x\, . . . , x^j 
Parameter: ip 

Output: ip(S) = {a£D k : (S,a) \= <p(x)}. 

The Boolean query problem (the subproblem where k = 0) is often called a model- 
checking problem. It will be denoted by MC (FOBij) here. As suggested by the formu- 
lation of the query problem, we are interested in its parameterized complexity and the 
complexity results given here consider the size of the query formula (p as the parameter 
(see [DF99]). 

1.2 Model of computation and measure of time 

The model of computation used in this paper is the Random Access Machine (RAM) 
with uniform cost measure (see [AHU74, GS02, GO04, FFG02]). As query problems 
are the main subject of this paper, instances of problems always consist of two kinds of 
objects: first-order structures and first-order formulas. 

The size \I\ of an object / is the number of registers used to store I in the RAM. If E 
is the set [n], \E\ = card(E) = n. If R C D k is a fc-ary relation over domain D, with 
\D\ = card(D), then \R\ = k.card(R): all the tuples (x±, . . . ,Xk) for which R(x\, . . . ,Xk) 
holds must be stored, each in a separate /c-tuple of registers. Similarly, if / is a unary 
function from D to D, all values f(x) must be stored and |/| = \D\. 
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If cp is a first-order formula, \<p\ is the number of occurrences of variables, relation or 
function symbols and syntactic symbols: 3, V, A, V, =,"(",")","," . For example, if 
ip = 3x3y R(x, y) A -<(x = y) then \(p\ = 17. 

All the problems we consider in this paper are parameterized problems: they take as 
input a list of objects made of a cr-structure S and a formula tp and as output the result 
of the query size <p(S). Due to the much larger size, in practice, of the structure S 
than the size of formula <p, \S\ >> \<p\, this latter one, \<p\ , in considered here as the 
parameter. 

A problem P is said to be computable in time /(|y|).T(|5|, |<^(<S)|) for some function 
/ : N — > R + if there exists a RAM that computes P in time (i.e., the number of 
instructions performed) bounded by /(|y?|).T(|5|, |<£>(cS)|) using space, i.e., addresses and 
register contents also bounded by /(|</?|).T(|<S|, |y>(<S)|). The notation {p {T{\S\, \ip{S)\)) 
is used when one does not want to make precise the value of function /. It is also assumed 
that the function T is at least linear and at most polynomial, i.e., T(n,p) = £l(n + p) 
and T(n,p) = (n+p) ^. To give an example and to relate our complexity measure to 
the logarithmic cost measure, in case T is linear, i.e., T(n,p) = n + p, the number of 
bits manipulated by the RAM is well linear in the number of bits needed to encode the 
input and the output. 

1.3 Enumeration algorithms and constant delay computation 

In this section, A is a binary predicate. Enumeration problems will be defined by 
reference to such a predicate. 

Definition 4 Given a binary relation A, the enumeration function Envm-A associated to A 
is defined as follows. For each input x: 

ENUM'i(x) = {y : A(x,y) holds } 

Remark 1 Query problems may evidently be seen as enumeration problems. The input x is 
made of the structure S and the formula <p(x), a witness y is a tuple a and evaluating predicate 
A amounts to check whether (S,a) \= <p(x). 

One may consider the delay between two consecutive solutions as an important point 
in the complexity of enumeration problems. In [JYP88] several complexity measures for 
enumeration have been defined. One of the most interesting is that of polynomial delay 
algorithm. An algorithm A is said to run within a polynomial delay if there is no more 
than a (fixed) polynomial delay between two consecutive solutions it outputs (and no 
more than a polynomial delay to output the first solution and between the last solution 
and the end of the algorithm). Polynomial delay is often considered as the right notion 
of feasability for enumeration problems. 

In this paper, we introduce a much stronger complexity measure that forces constant 
delay between outputs. 
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Definition 5 An enumeration problem EnunlA is constant delay with linear precomputation, 
which is written Enum-^4 e CONSTANT-DELAY Zin/ if there exists a RAM algorithm A which, 
for any input x, enumerates all the elements of the set Enum-^4(x) with a constant delay, i.e., 
that satisfies the following properties. 

1. A uses linear input space, i.e., space 0{\x\) 

2. A can be decomposed into the two following successive steps 

(a) precomp(*4) which runs some precomputations in time 0(\x\), and 

(b) enum(^4) which outputs all solutions within a delay bounded by some constant 
DELAY (.4). This delay applies between two consecutive solutions and after the last 
one. 

Allowing polynomial time precomputations (and polynomial space) instead of linear 
time, one may define a larger class called CONSTANT-DELAYpo^. 

Remark 2 As proved for the linear time class DLIN (see [GS02]), it can be shown that the 
complexity enumeration class CONSTANT-DELAY/j n is robust, i.e., is not modified if the set 
of allowed operations and statements of the RAMs is changed in many ways. This is because 
linear time (and linear space) precomputations give the ability to precompute the tables of new 
allowed operations. 

The following result is immediate, it evaluates the total time cost of any constant delay 
algorithm. 

Lemma 1 Let ENUMv4 be an enumeration problem belonging to Constant-Delay^ then, 
for any input x, the set Enum- j 4(x) can be computed in 0(\x\ + |ENUM- total time, 
i.e., in time linear in the size of \Input\ + \Output \, and linear input space 0(\x\). 

Remark 3 In the query problem we consider, the size of ip is considered as a parameter. Then, 
\x\ = \S\ and the constant delay depends on only. 

The two lemmas below give basic properties of constant delay computations. 

Lemma 2 An enumeration problem Enum-j4 computable in linear time O(\x\)for any input 
x belongs to Constant-Delay^ . 

Proof. For any input x, one only has to compute the set ENUM-^4(x), to sort it and 
to eliminate the possible multiple occurrences of solutions. These steps can be viewed 
as the precomputation part of the algorithm running in time 0(|x|). Then, one has to 
enumerate one by one the solutions of the sorted list. This is obviously a constant delay 
process. □ 

Lemma 3 Let Enum-j4 and Enum-I? be two disjoint enumeration problems, i.e., such that, 
for any input x, Enum-j4(x) n Enum-B(x) = 0. Let Enum-(^4 U B) be the union of this tivo 
enumeration problems defined by, for any x: 
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ENUM-(A U B)(x) = {y : A(x,y) or B(x,y) holds }. 



TfENUM-^ and Enum-5 belong to Constant-Delay^ then, problem Enum-,4 U B also 
belongs to CONSTANT-DELAY^ . 

Proof. Due to the disjointness of the two solutions sets for any input, the proof is 
evident. Given A and B the algorithms for problems Enum-j4 and ENUM-i?, the 
following algorithm correctly computes for the problem ENUM-yl U B. 

Algorithm 1 Constant delay algorithm for Enum-AU-B 
1: Input: x 

2: PRECOMP(^4); PRECOMP(£?) 
3: ENUM(y4); ENUM(£>) 



Obviously, the delay is bounded by the maximum of DELAY (.4) and DELAY (B). □ 

Remark 4 Note that the disjointness condition in the Lemma above is not always necessary. 
In case there exist a total ordering < and constant delay enumeration algorithms for Envm-A 
and Enum-S that enumerate solutions with respect to this unique ordering < then, it is easily 
seen that Enum -AuB belongs also to Constant-Delay^ even if the problems are not 
disjoints. 

2 First-order queries on bijective structures 



2.1 Quantifier elimination on bijective structures 

The key result of this paper consists of a quantifier elimination method for FC»Bij for- 
mulas. 

Theorem 4 (quantifier elimination for FOeij) Each bijective first-order formula is equiv- 
alent to a Boolean combination of bijective atomic formulas. More precisely, let ip(t) £ FC»Bij 
with free variables t then, there exists a Boolean combination of bijective atomic formulas <p'(t) 
over the same free variables t equivalent to <p(t). 

In the special case where <p is closed (i.e., without free variable) then, ip is equivalent to a Boolean 
combination of cardinality statements. 

Proof. As Vxip = —i(3x—>ip), we only have to consider elimination of existentially 
quantified variables. W.l.o.g., we consider formulas in disjunctive normal form and, 
as existential quantifier commutes with disjunction we may consider the case of the 
elimination of a single existentially quantified variable y in a formula of the form: 

(p(x) = 3y (cti A ... A Or) (1) 
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where each a, is a bijective literal among variables x and y. Literals depending on x 
only and cardinality statements need not be considered since they do not involve y, so 
<p(x) may be supposed of the following form: 

(p(x) = 3y [if>(y) A y = ei n{x^) A ... Ay = <Lk r k (x ik )] (2) 

where each y = e . Tj{xi j ) with ej = ±1 is y = Tj(xi-) if ej = 1 or y ^= Tj(xi j ) if ej = — 1. 
To eliminate quantified variable y two cases may happen. 

Suppose first there is at least one index j such that ej = 1. In this case, the equality 
y = Tj{xi j ) is used to replace each occurrence of y in the formula by the term Tjix^). 
The process results in a new formula <p'(x) without variable y. 

The second possibility leads to a more complicated replacement scheme. Suppose that 
for every j, ej = —1. Then, 

<p(x) = 3yliP(y)A/\y^Tj(xj)] (3) 

3<k 

(For simplicity of notations but w.l.o.g. we have supposed that ij = j for j = 1, . . . , k). 
The basic idea is now the following : suppose h < k is the number of distinct values 
among the k terms Tj(xj) such that ip(Tj(xj)) is true; then, formula (p(x) is true if and 
only if the number of y such that i/j(y) holds is strictly greater than h (i.e., 3y +1 ifj(y) is 
true). Introducing (new) cardinality statements in the formula, (p(x) can be equivalently 
rephrased as the following Boolean combination of bijective atomic formulas: 



v&)= V V 

h=0 PC[k] s QCP,\Q\=h 

A ^(rjixj)) A3^(y)\ 
j£Q iePjeQ je[k]\P 

(4) 

where [k] = {1, . . . , k}. 

More generally, starting from a prenex bijective first-order formula ip(t) with free vari- 
ables t, one eliminates all quantified variables from the innermost to the outermost one. 
This will result in an equivalent Boolean combination of bijective atomic formulas over 
t. In the case where (p is without free variable (i.e., t is empty), it is easily seen that 
the elimination process results in a Boolean combination of cardinality statements (note 
that, of course, 3x(p(x) = 3\ip(x)). □ 

One interesting consequence of Theorem 4 is the following result. 

Corollary 5 (Seese [See96]) The problem MC(FOBij) is decidable in time O v (\S\). 

Proof. From Theorem 4, we know that there exists a Boolean combination of cardinality 
statements over the same signature a equivalent to Given a formula 3^(x) one can 
test whether a given a-structure S satisfies S \= 3^(x) in time Oxp-(|*S|): it suffices to 
enumerate all the elements a of the domain, test whether (S,a) \= ^>(x) in constant 
time and count those for which the answer is positive. If this number is greater than 
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or equal to k then 3^(x) is true in S. The final answer for 3> is given by the boolean 
combination of the answers for each cardinality statement. □ 



2.1.1 Considerations on an efficient implementation of the algorithm 

Compared to the method of [See96] , the proofs given in this paper are constructive and 
easily implementable. But, due to the case of Formula 3 in Theorem 4 which leads to the 
equivalent Formula 4 the whole process is in 0^(151) = 0(/(|y>|).|«S|) for some function 
/ that may be a tower of exponentials. It can be shown that it heavily depends on the 
number of variables and of quantifier alternations of the formula. However, the size of 
the function / can be substantially reduced in case there are few quantifier alternations. 
In what follows, we revisit the method of the proof of Theorem 4 to prove a slightly 
different result in a specific case. We focus on formulas with existentially quantified 
variables only and show that the model-checking problem for such formulas can be 
efficiently evaluated. A FOBij formula is in £i— FOBij if it is of the form: 

3y ip 

where ip is quantifier-free and in disjunctive normal form (DNF). 

Corollary 6 The model-checking problem for Si — FO-Bij formulas can be evaluated in time 
0(\ip\ d .\S\) where d is the number of distinct variables of (p. 

Proof. The result obviously holds for d = 1. So, assume d > 1. For the same reason as 
in Theorem 4, we may consider any formula of the form: 

ip(x) = 3y (0:1 A ... A a r ) (5) 

where each cti is a bijective literal 1 with variables among x and y. For sake of com- 
pleteness here, we consider also terms not containing y. Then, <p(x) is of the form: 

tp(x) = 3y [il>(y) A y = ei A ... Ay = efc T k {x ik ) A j(x)] (6) 

with the same notation €j as in the proof of Theorem 4 and 'y(^) involves variables of x 
only. Again, if €j = 1, for some j, then all the occurences of y are replaced by Tj{xi j ) 
and <p(x) is equivalent to a conjunction of literals without variable y. 
Suppose now that €j = —1 for all j < k. Let A = {a £ D : (5, a) \= ijj(y)}. Since i^{y) is 
quantifier-free, A can be computed in time 0(|^|.|<S|). Two cases need to be considered 
now. If \A\ > k, since there are at most k different values Tj(xj) for j = 1, . . . , k, then 
the conjunction 3y[ip(y) Ay ^ Tiix^) A ... At/ / T k(xi k )] is always true and (p(x) is 
simply equivalent to "y(x). If \A\ < k let A = {ai, . . . , a^}, with h < k. Formula tp(x) is 
replaced by the equivalent formula below over the richer signature a U {ai, . . . ,a^}: 

V(A a ^ r iK) A 7(^)) 

i<h j<k 

1 In this proof, bijective literals do not involve cardinality statements 
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In all cases, the formula obtained is also in DNF. Time 0(|<£>|.|<S|) is needed to eliminate 
variable y and the new formula is of size bounded by 0(k.\(p\), i.e., less than 0(|y| 2 ). 
Elimination of all the d existentially quantified variables except the last one can be 
pursued from this new formula (without need for a normalisation). In the worst case 
(where all literals are of the form Xi ^ ri(xj)), the process will result in a disjunction 
of less than l^l^" 1 conjunctions of at most \ip\ literals. □ 

2.2 Constant delay algorithm for first-order queries on bijective struc- 
tures 

We are now ready to state the main result of this section. 

Theorem 7 The problem Query (FO B y) G Constant-Del AY Z in . In particular, from 
Lemma 1, it can be computed in time O v (\S\ + \ip(S)\) and space O^dSj). 

Definition 6 A bijective literal is a bijective atomic formula or its negation. 

Before proving Theorem 7, we establish the following lemma. 

Lemma 8 Let Sbea bijective structure and <f be a conjunction of bijective literals. Computing 
query S h-> 9(S) can be done in CONSTANT-DELAY/j n . 

Proof. The result is proved by induction on k the number of free variables of ^f(x) 
where x = (xi, . . . , Xk)- We even assume that ^> makes use of explicit constants from 
domain D of S. 

For the case k = 1, it is evident that the one variable query Q = {a € D : (S, a) \= 
ty(x)} can be evaluated in time 0^(|D|) = 0^(|«S|) and hence, by Lemma 2, is in 
Constant-Delay^ . 

The result is supposed to be true for k (k > 1) and proved now for k + 1. Let's consider 
the query: 

Q = {(a,b) £D k+1 :S\=*(x,y)} 

where the conjunction of bijective literals is over variables x = (x±, . . . , x^) and y. As 
for Theorem 4, two cases need to be distinguished. 

1. Vl/ contains at least one literal of the form T\(y) = T2(xi ), 1 < iq < k, that can 
also be rephrased as y = r(xi ), 

2. ^ does not contain such a literal. 
In the first case, ^ can rewritten as: 

= $o(x,y) Ay = r(x io ). 

Query Q is then equivalent to: 

Q = {(a,r(a i0 )) G D k+1 : (S,a) \= ^> (x,r(x io ))}, 
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which is essentially the following k variable query Q': 

Q' = {a€D k : (5, a) \= * (x, r(x io )}. 

To be precise, Q = {(a, r(aj )) : a E Q'}. By the induction hypothesis, query Q' can be 
computed by some algorithm A 1 in constant delay. This provides the following constant 
delay procedure for query Q. 

Algorithm 2 Evaluating query Q 
1: Input: 5,* 

2: PRECOMP(^4 / ) 

3: Apply enum(^4') and for each enumerated tuple a, output (a, r(aj )) instead 



Case 2 is a little more complicated. Formula \P can be put under the following form: 

* = *x(x) A* 2 (y) A /\ y^nixji) 

l<i<r 

with 1 < ji < k for 1 < i < r. By induction hypothesis, the A; variable query: 

Q 1 = {o€D fc :(5,o)|=« r l(^)} 

can be computed by an algorithm Ai on input 5 with constant delay. For similar reason, 
the k variable query Q b over structure (<S, b) defined by: 

Q b = {a £ D k : (S,a,b) ^ V(x,y)}} 

can be enumerated by an algorithm using constant delay. Let now Q2 be: 

Q 2 = {b£D: (5,6) h*2(y)}- 

If IQ2I < r then, by Lemma 3, there exists an algorithm Aq which enumerates the 
disjoint union Ub & Q 2 Qi, x {b} with constant delay. Note that L>b € Q 2 Qb x {b} = Q. From 
what has been said Algorithm 3 below correctly computes query Q. 

Up to step 5 of the algorithm, all can be done in linear time. 

It remains to show that, in the case where IQ2I > r + the delay between two successive 
solutions is bounded by some constant. Since IQ2I > r + 1 and the number of b E Q2 
that verify (S,a,b) \/= \Ji< i<r y = Ti(xj t ) is bounded by r, the algorithm outputs at 
least one (a, 6) for each a E Q±. More precisely, it outputs IQ2I — r such tuples. For 
the same reasons, the maximal delay between two successive outputs is then bounded 
by 2r. The same arguments apply for the delay between the last solution and the end 
of the algorithm. Then, computing Q can be done in constant delay. □ 

Proof of Theorem 7. Let 5 and ip(x) be instances of the QUERY (FOBij) problem. From 
Theorem 4, one can transform (p(x) into the following equivalent formula in disjunctive 
normal form: 

<f(x) = ^1(2) V ... V y q (x) 

where each is a conjunction of bijective literals and for all 1 < i < j < q and 
all bijective structures 5, ^i(S) n *&j(S) = 0. The Theorem immediately follows from 
Lemma 3 since the enumeration problem of each query 5 1— > \Pj(5), 1 < i < q, belongs 
to Constant-Del ay^„ by Lemma 8. □ 
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Algorithm 3 Evaluating query Q 



1: Input: 

2: Compute Q2 and IQ2I 
3: if IQ2 1 < r then run Aq 
4: else 

5: PRECOMPf^x) 

6: for a € ENUM(^i) do 

7: for 6 e Q2 do 

8: if {S,a, b) \f= Vi<i<r V = T i(xji) then Output (a, 6) 

9: end if 

10: end for 

11: end for 
12: end if 



3 Relational structures of bounded degree 



3.1 Two equivalent definitions 

Let p = {Ri, . . . , R q } be a relational signature, i.e., a signature made of relational 
symbols Ri each of arity a^. Recall that arity(p) = maxi<i< q (ai) = m. 
Let S = (D; R±, . . . , R q ) be a p-structure. For each i < q, Ri C D a \ The degree of an 
element x in S is defined as follows: 

degree s {x) = ^ ^ tf{(yi, . . . ,y ai ) € £> ai : 3j < a* s.t. x = yj and 5 |= ^(yi, . . . , y a . 

Intuitively, degrees (x) is the total number of tuples of relations Ri to which x belongs 
to. One defines the degree of a structure as degree(S) = max x< z£,{degrees{x)) . 

Remark 5 In [See96] a different definition of the degree of a structure is given. It counts, for 
each x, the number of distinct elements y 7^ x adjacent to x, i.e., that appear in some tuple with 
x. More precisely, 

degree s (x) = ${v '■ y ^ x an & 3i < q,t £ D a \ s.t. S \= Ri(t) and x, y € t}, 
and degree 1 (S) = max x£ £)(degree s (x)). 

Since each tuple containing x contains at most m — 1 elements different from x, it is easily seen 
that: 

degree 1 (S) < (m — l).degree(S) where m = arity(p). 

Conversely, for each x, if there exist at most d elements y e D adjacent to x then, the number 
of distinct tuples involving x and y is bounded by q.m.d 171 ' 1 . Hence, 

degree{S) < q.m.{degree 1 {S)) m ~ 1 . 
So, the two measures yield the same notion of bounded degree structure. 
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We are interested in the complexity of the following query problem for bounded degree 
structures (which is clearly independent of either measure of degree we choose). 



Query (FO Deg ) 

Input: an integer d, a relational signature p, a p-structure S with degree{S) < d and a 
first-order p- formula f(x) with k free variables x = (x%, . . . , x^) 
Parameter: d, ip 

Output: tp(S) = {aeD k : (S,a) \= cp(x)}. 

3.2 Interpreting a structure of bounded degree into a bijective struc- 
ture 

In this section, we present a natural reduction from QUERY (FODeg) to QUERY (FOeij) 
which is obtained by interpreting any structure of bounded degree into a bijective one. 
Let S = (D; R±, . . . , R q ) be a p-structure of domain D, of arity m = maxi<i< q arity{Ri) 
and of degree bounded by some constant d. One associates to S a bijective c-structure 
S 1 = {D'\ D, Ti, . . . , T g ,g, f\, . . . , f m ) of domain D' where D,T±, . . . ,T q are pairwise 
disjoints unary relations (i.e. subsets of D') and g, fi, . . . , f m are permutations of D' . 
Structure S' is precisely defined as follows: 

• D corresponds to the domain of S. 

• Ti (1 < i < q) is a set of elements each representing a tuple of Ri (hence, card(Ti) = 
card(Ri)). 

The new domain D' is the disjoint union: D U (D x {1, . . . , d}) U T\ U . . . U T q . 
Let us use the following convenient abbreviations: U = D U (D x {1, . . . , d}) and 
^ = Ui<i< g Ti- 

• g creates a cycle that relates d copies of each element x of the domain. More 
precisely, for each x G D, it holds g{x) = (x, 1), g((x, i)) = (x,i + 1) for 1 < i < d, 
and g((x,d)) = x. We also set g(x) = x for all other x (x € T). 

• Each fi is an involutive permutation and essentially represents a projection of T 
into D as follows. Let Ri(x\, . . . , x^) be true in <S for some relation Ri of arity 
k < m and some /c-tuple (xi, . . . , Xk) € D k . Suppose Ri(xi, . . . , Xk) is represented 
by element t £ Ti, then, for each j < k, set fj(t) = (xj,h) and set the reciprocal 
f((xj,h)) = t if R(x±, . . . ,Xk) is the h th tuple in which Xj appears (with h < d). 
The construction is completed by loops fj(x) = x for all other x£D'. 

Figure 1 details the reduction on an example. 

It is clear that, by construction, S' is a bijective structure and that we have the following 
interpretation Lemma. 

Lemma 9 Let Oi be the a-formula below associated to any symbol Ri e p of arity k: 
ei( Xl ,...,x k ) = 3t(Ti(t)A /\ \/ /,(*)= g h ( Xj )). 

l<i<fc l<h<d 
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Figure 1: Our reduction on an example: the original structure (digraph) of degree 3 is 
on the right side of the picture 

Then, for all {a\ . . . , a k ) £ D k : 

(5,oi,...,Ofc) \= Ri(xi,...,x k ) (S',ai,...,a k ) \=9i(xi,...,x k ). 

To each first-order p-formula ip{x\, . . . , x p ), one associates the c-formula <p"{x\, . . . , x p ) 
obtained by replacing each quantification 3v (resp. Vu) by the relativized quantifica- 
tion (3vD(v)) (resp. (VvD(v))) (that can be written respectively as 3v(D(v) A ...) and 
Vv(D(v) — ► ...)) and by replacing each subformula Ri(x\, . . . , x k ) by 9i(xi, . . . , x k ). 
The following proposition and lemma express that our reduction is correct and linear in 
\S\. Because of Lemma 9, Proposition 10 can be easily proved by induction on formula 
ip. 

Proposition 10 (interpretation of S into S') For all (x\ . . . , x p ) € D p : 

(S,ai,...,a p )\=(p(x 1 ,...,x p ) (S',ai,...,a p )\=ip"(xi,...,x p ). 

In other words: <p(S) = <p"(S') fl D p . Then, setting ip'(xi, . . . ,x p ) = <p"(xi, . . . ,x p ) A 
f\ t < p D( Xi ), it holds: <p(S) = <p'{S') 

Lemma 11 Computing S' from S can be done in linear time O p ^{\S\). 
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Proof. As computing S' from S is easy, one has only to compare the size of the two 
structures. The size of S is: 

q q 
\S\ = @(\D\ + card{Ri).arity{Ri)) = @ P (\D\ + ^ card(Ri)). 

i=l i=l 

For S', by construction, it holds that: 

<? 

\D'\ = (d+l).\D\ + ^2card(Ri) = & d , p (\S\). 

i=l 

Hence, |«S'| = ®{m\D'\) = Q d)P {\S\). □ 
We are now ready to state and prove the main result of this section. 
Theorem 12 QuERY(FODeg) belongs to Constant-Del AY^„ . 

Proof. Let A be a constant delay algorithm that computes queries of QUERY (FOBij). 
By using Proposition 10, the algorithm below correctly evaluates queries in QUERY (FOoeg)- 

Algorithm 4 Evaluating QuERY(FC>Deg) 
1: Input: S, d, cp 

2: Compute the u-formula (f'(x) associated to ip (and d) 

3: Compute the bijective cr-structure S' associated to S (and d) 

4: Run A on input 5', ip' 

The cost of instruction 2 is 0^(1), that of instruction 3 is 0^(151) (by Lemma 11) and 
the precomputation part of algorithm A (included in instruction 4) is 0^/(15'!) (hence 
O^ddiSD) by Theorem 7. These steps form a precomputation phase of time complexity 
0<^,rf(|<5|). Finally, the effective enumeration of </?(<S) = ^p'{S') is handled on S', ip' by A 
and is performed with constant delay. □ 

3.3 Complexity of subgraphs problems 

In this part, we present a simple application of our result to a well-known graph problem. 
Given two graphs G = (V;E) and H = (Vh;Eh), H is said to be a subgraph (resp. 
induced subgraph) of G if there is a one-to-one function g from Vh to V such that, for 
all u, v € Vh, E(g(u), g(v)) holds if (resp. if and only if) Eh(u,v) holds. 

GENERATE SUBGRAPH (resp. GENERATE INDUCED SUBGRAPH) 

Input: any graph H and a graph G of degree bounded by d 
Parameter: \H\,d. 

Output: All the subgraphs (resp. induced subgraphs) of G isomorphic to H. 

The treewidth of a graph G is the maximal size of a node in a tree decomposition of G 
(see, for example, [DF99]). In [PV90] it is proved that for graphs H of treewidth at most 
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w, testing if a given graph H is an induced subgraph of a graph G of degree at most d 
can be done in time f(\H\,d).\G\ w+ . In what follows, we show that there is no reason 
to focus on graphs of bounded treewidth and that a better bound can be obtained for 
any graph H (provided G is of bounded degree). In the result below, we prove that not 
only the complexity of this decision problem is f(\H\,d).\G\ but that generating all the 
(induced) subgraphs isomorphic to H can be done with constant delay. 

Corollary 13 The problem GENERATE SUBGRAPH (resp. GENERATE INDUCED SUBGRAPH) 
belongs to Constant-Delay^ 

Proof. The proof is given for the erate geinduced subgraph problem. Let G = (V;E) 
and H = ( Vh = {hi , • • ■ , h^}; Eh) (\Vh\ = k) be the two inputs of the problem. Since G 
is of maximum degree d, we can partition its vertex set V into d sets V° , . . . , V d where 
each V a is the set of vertices of degree a. This can be done in linear time 0(|G|). We 
proceed the same for graph H and obtain the sets V H , . . . , V H . In case there exists a 
vertex in H of degree greater than d, it can be concluded immediately that the problem 
has no solution. Now, let Q be the following formula: 

Q(xi,...,x k ) = f\ Xi^xjA f\ Vg(xi)f\ f\ E(xi,Xj). 

i<j<k Vg(hi) E H (hi,hj) 

Formula Q simply checks that H is a subgraph of G and that each distinguished vertex 
Xi of G has the same degree as its associated vertex hi in H. Note that formula Q only 
depends on H and d. The result follows now from Theorem 12. □ 

4 Conclusion 

In this paper, we study the complexity of evaluating first-order queries on bounded 
degree structures and consider this evaluation as a dynamical process, i.e., as an enu- 
meration problem. Our main contributions are two-fold. First, we define a simple 
quantifier elimination method suitable for first-order formulas which have to be eval- 
uated against a bijective structure. Second, we define a new complexity class, called 
CONSTANT-DELAY;j n , for enumeration problem which can be seen as the minimal ro- 
bust complexity class for this kind of problems and we prove that our query problem on 
bounded degree structures belong to this class. 

There are several interesting directions for further researches. Among them, the two 
following series of questions seem worth to be studied: 

• Which "natural" query problems belong to CONSTANT- DEL AY^ n ? More gen- 
erally, which kind of combinatorial or algorithmic enumeration problems admit 
constant delay procedures ? 

The same questions can be asked for the larger class CONSTANT-DELAY^,, of 
constant delay enumeration problems for which polynomial time (instead of linear 
time) precomputations are allowed. 
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• What are the structural properties of the class CONSTANT-DELAY/j n or of the 
larger CONSTANT-DELAY^,, ? Do they have complete problems ? Under which 
kind of reductions ? Could they be proved to be different from the classes of 
enumeration problems solvable with linear or polynomial delay ? 

Acknowledgment. We thank Ron Fagin for a very fruitful email exchange that lead 
us to define complexity notions about constant delay computation. 
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